Panoramic Video Salient Object Detection with Ambisonic Audio Guidance

نویسندگان

چکیده

Video salient object detection (VSOD), as a fundamental computer vision problem, has been extensively discussed in the last decade. However, all existing works focus on addressing VSOD problem 2D scenarios. With rapid development of VR devices, panoramic videos have promising alternative to provide immersive feelings real world. In this paper, we aim tackle video for videos, with their corresponding ambisonic audios. A multimodal fusion module equipped two pseudo-siamese audio-visual context (ACF) blocks is proposed effectively conduct interaction. The ACF block spherical positional encoding enables 3D capture spatial correspondence between pixels and sound sources from equirectangular frames Experimental results verify effectiveness our components demonstrate that method achieves state-of-the-art performance ASOD60K dataset.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Co-Salient Video Object Detection Based on Preattentive Processing

Automatic video annotation is a critical step for contentbased video retrieval and browsing. Detecting the focus of interest such as co-occurring objects in video frames automatically can benefit the tedious manual labeling process. However, detecting the co-occurring objects that is visually salient in video sequences is a challenging task. In this paper, in order to detect co-salient video ob...

متن کامل

Video Salient Object Detection Using Spatiotemporal Deep Features

This paper presents a method for detecting salient objects in videos where temporal information in addition to spatial information is fully taken into account. Following recent reports on the advantage of deep features over conventional handcrafted features, we propose the SpatioTemporal Deep (STD) feature that utilizes local and global contexts over frames. We also propose the SpatioTemporal C...

متن کامل

Salient Object Detection with Semantic Priors

Salient object detection has increasingly become a popular topic in cognitive and computational sciences, including computer vision and artificial intelligence research. In this paper, we propose integrating semantic priors into the salient object detection process. Our algorithm consists of three basic steps. Firstly, the explicit saliency map is obtained based on the semantic segmentation ref...

متن کامل

Salient Object Detection: A Survey

Detecting and segmenting salient objects in natural scenes, also known as salient object detection, has attracted a lot of focused research in computer vision and has resulted in many applications. However, while many such models exist, a deep understanding of achievements and issues is lacking. We aim to provide a comprehensive review of the recent progress in this field. We situate salient ob...

متن کامل

Automatic Salient Object Detection for Panoramic Images Using Region Growing and Fixation Prediction Model

Saliency detection aims to detect the most attractive objects in images, which has been widely used as a foundation for various multimedia applications. In this paper, we propose a novel salient object detection algorithm for RGB-D images using center-dark channel prior. First, we generate an initial saliency map based on a color saliency map and a depth saliency map of a given RGB-D image. The...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i2.25227